A TOPOLOGICAL CHARACTERIZATION OF KNOTS AND LINKS 
ARISING FROM SITE-SPECIFIC RECOMBINATION 



DOROTHY BUCK AND ERICA FLAPAN 

Abstract. We develop a topological model of knots and links arising from a single (or 
multiple processive) round(s) of recombination starting with an unknot, unlink, or (2, m)- 
torus knot or link substrate. Wc show that all knotted or linked products fall into a single 
family, and prove that the size of this family grows linearly with the cube of the minimum 
number of crossings. Additionally, we prove that the only possible products of an unknot 
substrate are either clasp knots and links or (2, m)-torus knots and links. Finally, in the 
(common) case of (2, m)-torus knot or link substrates whose products have minimal crossing 
number m + 1, we prove that the types of products are tightly prescribed, and use this to 
examine previously uncharacterized experimental data. 



1. Introduction 

Molecular biologists are interested in DNA knots and links, because they have been impli- 
cated in a number of cellular processes. The axis of DNA molecules can become knotted or 
linked as a result of many reactions, including replication and recombination. The wide vari- 
ety of DNA knots and links observed has made separating and characterizing these molecules a 
critical issue. Experimentally, this is most conclusively accomplished via electron microscopy 
[15]. However, this is a laborious and difficult process. Thus topological techniques, such 
as those presented here, can aid experimentalists in characterizing DNA knots and links by 
restricting the types of knots or links that can arise in a particular context. 

This work focuses on one such DNA knotting process, site-specific recombination, mediated 
by a protein, known as a site-specific recombinase. Site-specific recombination is important 
because of its key role in a wide variety of biological processes. (See [2] or e.g. [7] for 
more information). In addition, pharmaceutical and agricultural industries have become 
increasingly involved in genetically modifying organisms or testing whether a mutation in 
a particular gene leads to a disease. As a result, these industries are now interested in 
site-specific recombinases as tools for precisely manipulating DNA {e.g. [11]). 

Site-specific recombination roughly has three stages. Two recombinase molecules first bind 
to each of two specific sites on one or two molecules of covalently closed circular DNA (known 
as the substrate) and then bring them close together. We shall refer to these DNA sites as the 
crossover sites. Next, the sites are cleaved, exchanged and resealed. The precise nature of this 
intermediary step is determined by which of the two recombinase subfamilies the particular 
protein belongs to (see Assumption 3 below for more details). And finally, the rearranged 
DNA, called the product, is released. Understanding precisely which knots and links arise 
during site-specific recombination can help understand the details of this process {e.g. [12]). 
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In this paper we begin by developing a model that predicts all possible knots and links which 
can arise as products of a single round of recombination, or multiple rounds of (processive) 
recombination, starting with substrate(s) consisting of an unknot, an unlink, or a (2, m)-torus 
knot or link (denoted by T(2,m)). This model rigorously develops and extends ideas that 
wc originally sketched in [3]. Of all knots and links, we have chosen to focus on T(2,m), 
because T(2, m) are the most commonly occurring knots and links in DNA . Our model rests 
on three assumptions that we justify biologically in [2] . Building on these assumptions, we 
use knot theoretic techniques to prove that all products fall within a single family, illustrated 
in Figure 10. We then prove that the number of product knots and links predicted by our 
model grows linearly with the cube of the minimal crossing number. We further prove that 
the product knot or link type is tightly prescribed when the substrate is T{2,m) and the 
product has minimal crossing number m + 1. Finally, we apply this new result to previously 
uncharactcrizcd experimental data. 

This paper complements earlier work by Sumners, Ernst, Cozzarelli and Spengler [18], 
which used the tangle model [9] and several biologically reasonable assumptions to solve 
tangle equations. They then determined which 4-plat knots and links arise as a result of 
(possibly processive) site-specific recombination on the unknot for the serine subfamily of 
recombinases (see just before Assumption 3 for a discussion of the two subfamilies). For the 
particular case of the recombinase Gin, they considered the knots 3i,4i,52 or 6i as well as 
unknotted substrates. Our paper goes further in several ways. In addition to allowing an 
unknotted substrate for a generic recombinase, we allow substrates that are unlinks with one 
site on each component, as well as any T{2,m). Also, our assumptions are based exclusively 
on the biology of the recombination process. In particular, we do not assume the tangle 
model holds or that all products must be 4-plats. Allowing products which are not 4-plats 
is important because recombination has been seen to produce knots and links which are 
connected sums (see [2]). 

We will use the following terminology and notation. Let J denote a substrate which is 
either an unknot, an unlink, or T{2, m) (illustrated in Figure 4). We use the term recombinase 
complex, B, to refer to the convex hull of the four bound recombinase molecules together with 
the two crossover sites, and use the term recom,hinase-DNA complex to refer to B together 
with the substrate J. If the recombinase complex meets the substrate in precisely the two 
crossover sites then we say the recombinase complex is a productive synapse. In Figure 1, we 
illustrate two examples where the recombinase complex i? is a productive synapse, and one 
where B is not. Finally, we let C = cl(R'^ — B), and consider B H J and C H J separately. 

The structure of the paper is as follows. In Section 2, we state our three assumptions about 
the recombinase-DNA complex, and use our assumptions to determine the pre-recombinant 
and post-recombinant forms of BCiJ. In Section 3, we characterize the forms of Cfl J for each 
of our substrates. In Section 4, we glue each of the post-recombinant forms of i? fl J to each 
form of C n J to determine all possible knotted or linked products predicted by our model. 
Finally in Section 5, we bound the size of this product family, and further limit product type 
in some special cases, by combining our model with results on minimal crossing number. 




Figure 1. The two examples on the left have a productive synapse and the 
one on the right does not. The crossover sites are highlighted. 

2. Our assumptions and B n J 

2.1. The three assumptions. We make the following three assumptions about the recombinase- 
DNA complex, which we state in both biological and mathematical terms. In [2] we provide 
experimental evidence showing that each of these assumptions is biologically reasonable. 

(Biological) Assumption 1: The recombinase complex is a productive synapse, and there 
is a projection of the crossover sites which has at most one crossing between the sites and no 
crossings within a single site. 
This is equivalent to: 

(Mathematical) Assumption 1: B D J consists of two arcs and there is a projection of 
B n J which has at most one crossing between the two arcs, and no crossings within a single 
arc. 

Form B 1 Form B2 Form B3 

® © ® 

Figure 2. We fix a projection of J so that B f] J has one of these forms. 

As a result of this assumption, we now fix a projection of J such that BDJ has one of the 
forms illustrated in Figure 2. 

(Biological) Assumption 2: The productive synapse does not pierce through a supercoil 
or a branch point in a nontrivial way. Also, no persistent knots are trapped in the branches 
of the DNA on the outside of the productive synapse. 

Assumption 2 implies that the recombinase-DNA complex cannot resemble either of the 
illustrations in Figure 3. 

In order to restate Assumption 2 mathematically, we first introduce some terminology. We 
shall use the term spanning surface to refer to a surface D, bounded by J, such that D is 
topologically equivalent to a disk, two disjoint disks, or a twisted annulus when J is an unknot. 
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D n aB contains a circle C n D is knotted 

Figure 3. On the left, the productive synapse pierces through a supercoil 
in a nontrivial way. On the right, a knot is trapped in the branches on the 
outside of B 




J = unknot J = unlink J = T(2,m) 

D = disk Y) = two disks D = twisted annulus 

Figure 4. Examples of different substrates J and a spanning surface D 
bounded by J. 

unlink, or T(2, m), respectively. Figure 4 gives some examples of the relationship between a 
spanning surface D and the productive synapse B. Observe that in each of the illustrations 
of Figure 4, D D dB consists of two arcs. By Assumption 1, B contains precisely two arcs 
of J = dD. Hence dB meets J in precisely four points. It follows that the intersection of 
any spanning surface for J with dB contains exactly two arcs. What we mean by B does not 
pierce through a supercoil or a branch point in a nontrivial way is that B does not pierce the 
interior of every spanning surface for J (as in the left illustration in Figure 3). In general, 
a spanning surface D is pierced by B if and only if D n dB contains at least one circle in 
addition to the required two arcs. For example, in the diagram on the left in Figure 3, no 
matter how the spanning surface D is chosen, D D dB contains at least one circle as well as 
two arcs. 

Next, we explain what we mean by no persistent knots are trapped in the branches outside 
of B. Consider a planar surface together with a finite number of arcs whose endpoints are 
on the boundary of the surface (see the illustration on the left in Figure 5). We can obtain 
a surface bounded by a knot or link by replacing a neighborhood of each arc in the original 
surface by a half-twisted band and removing the top and bottom ends of the band. Figure 5 
illustrates how such a surface can be obtained from an annulus together with a collection of 
arcs defining the twists. Any surface obtained from a planar surface in this way is said to be 
a planar surface with twists. 
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Figure 5. We obtain a planar surface with twists by replacing a neighborhood 
of each arc by a half-twisted band. 

Suppose that D is a spanning surface for J. We say that D f] C is unknotted rel dB, if 
there is an ambient isotopy of C pointwise fixing dB which takes D DC to a planar surface 
with twists, where the endpoints of the arcs defining the twists are disjoint from dB. For 
example, D f] C is unknotted rel dB for each of the spanning surfaces in Figure 4. This is 
not the case for the surfaces D n C in Figure 3. 

We now restate Assumption 2 mathematically as follows. 

(Mathematical) Assumption 2: J has a spanning surface D such that D D dB consists 
of two arcs and D DC is unknotted rel dB. 

Site-specific recombinases fall into two families - the serine and tyrosine recombinases. 
Assumption 3 addresses the mechanism of recombination according to which subfamily the 
recombinase is in. While the overall reactions of the two families of recombinases are the 
same, they differ in their precise mechanism of cutting and rejoining DNA at the crossover 
sites. We explain more of the biological details in [2]. 

(Biological) Assumption 3: Serine recombinases perform recombination via the "subunit 
exchange mechanism." This mechanism involves making two simultaneous (double-stranded) 
breaks in the sites, rotating opposites sites together by 180° within the productive synapse 
and resealing opposite partners. In processive recombination, each recombination event is 
identical. After recombination mediated by a tyrosine recombinase, there is a projection of 
the crossover sites which has at most one crossing. 

The mathematical restatement of Assumption 3 is almost identical to the biological state- 
ment. 

(Mathematical) Assumption 3: Serine recombinases cut each of the crossover sites and 
add a crossing within B between the cut arcs on different sites, then reseal. In processive 
recombination, all recombination events are identical. After recombination mediated by a 
tyrosine recombinase, there is a projection of the crossover sites which has at most one 
crossing. 

2.2. The forms of BDJ. As a result of Assumption 1, we fixed a projection of J such that 
B D J has Form Bl, B2, or B3 (illustrated in Figure 2). It follows from Assumption 3 that 
after n recombination events with serine recombinases, we have added a row of n identical 
crossings. Thus after n recombination events our fixed projection of i? H J is isotopic fixing 
dB to one of the forms illustrated in Figure 6 (where the actual crossings can be positive, 
negative, or zero). 
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Figure 6. After n recombination events with serine recombinases, each pre- 
recombinant form oiBV\J leads to the corresponding post-recombinant form. 



Also for tyrosine recombinases, we know from Assumption 3 that after recombination there 
exists a projection of i? n J with at most one crossing. We are working with the projection 
of J which we fixed as a result of Assumption 1 , and we cannot be sure that this particular 
projection of i3 H J will have at most one crossing. However our projection must be ambient 
isotopic, fixing dB^ to one of the forms illustrated in Figure 7. So without loss of generality 
we will assume that the post-recombinant projection oi B V\ J has one of these forms. 

FormBl Form B 2 Form B 3 Form B4 Form B5 

® © ® ® © 

Figure 7. After recombination with tyrosine recombinases, the post- 
recombinant projection of i? n J has one of these forms. 



3. The possible forms of C n J 

Using Assumption 2, we now prove the following Lemma. 

Lemma 1. Suppose that Assumptions 1, 2, and 3 hold for a particular recomhinase-DNA 
complex where the substrate J is an unknot, unlink, or a T(2,m) knot or link. Then C f] J 
has a projection with one of the forms illustrated in Figure 8 where p + q = m. Furthermore, 
if C f] J has Form C4, then B f] J must have Form Bl in Figure 7. 

Proof. We consider separate cases according to the knot or link type of J. 

Case 1: J is the unknot. 

In this case, by Assumption 2, we can choose a spanning surface D which is a disk such 
that D n dB is two arcs and D r\C \s unknotted rel dB. Since D is a disk, the two arcs of 
dB n D separate D such that one of 5 n L> and C H is a strip and the other is a pair of 
disjoint disks. Furthermore, if C H D is a strip it is not knotted. Thus, C H D is either a 
pair of disjoint disks or an unknotted twisted strip. It follows that C Pi J is ambient isotopic, 
pointwise fixing dB, to Form CI or Form C2. 
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unlink ^ ' ' same component j = T(2,m) 



Figure 8. C n J has a projection with one of these forms. 

Case 2: J is the unhnk 

In this case, we assume that one site is on each component of J (or else the substrate was 
actually an unknot). Thus by Assumption 2, we can choose a spanning surface D which is a 
pair of disjoint disks such that dB meets each disk of in a single arc. Hence, B D D and 
C n D are each a pair of disjoint disks. It follows that C n J is ambient isotopic, pointwise 
fixing dB, to Form C2. 

Case 3: J = T{2,m) 

In this case, by Assumption 2, we can choose a spanning surface Z? to be a twisted annulus 
such that D D dB is two arcs and D n C is unknotted rel dB. We see as follows that there 
are several ways the arcs D f] dB can lie in D. 

Any circle in must cross a sphere an even number of times (possibly zero). In particular, 
the circle A representing the core of the twisted annulus D must cross dB an even number 
of times. Each point where A crosses dB is contained in D D dB. Since the total number 
of points in yl n dB is even and D H dB consists of two arcs, either A must intersect each 
of these two arcs an even number of times, or A must intersect each of the two arcs an odd 
number of times. If each arc of -D Pi dB intersects the core A an odd number of times, then 
each of these arcs cuts D into a strip. Hence the two arcs of -D n dB together cut D into a 
pair of strips. If each arc of -D n dB intersects the core A an even number of times, then each 
arc cuts off a disk from D. In this case, either the two arcs cut off disjoint disks in D, or one 
of the disks is contained inside of the other. In this latter case, the two arcs form the edges 
of a strip in D, on one side of which is a disk and on the other side of which is a twisted 
annulus. The three forms D D dB are illustrated on the top of Figure 9. Note that the 
illustration on the right may have one, rather than two, rows of twists. Since B f] J contains 
at most one crossing, the component of D with almost all of the twists of T{2,m) must be 
contained in C. 

Since C H -D is unknotted rel dB, the abstract forms illustrated on the top of Figure 9 
yield the corresponding forms oi C r\ D which are illustrated in the bottom of Figure 9 up 
to isotopy fixing dB. Observe that when C D J has Form C4, the projection oi B f] J must 
have Form Bl as illustrated. Also, in Form C3, while there may be twists to the left of B, 
they are topologically insignificant, since they can be removed by rotating D D C hy some 
multiple of tt. Similarly, in Form C4, any twists which had occurred to the left of B can be 
removed and added to the row of twists at the right by rotating D DC hy some multiple of 
TT. These rotations can occur while pointwise fixing B. 

Thus the four forms of C Pi D illustrated in Figure 8 are the only ones possible. □ 
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Figure 9. These are the forms oi C H D when J = T(2, m). 

4. Product knots and links predicted by our model 

In this section, we suppose that the substrate is an unknot, an unhnk, or T(2,m) and that 
ah three of our assumptions hold for a particular recombinase-DNA complex. Then we prove 
Theorems 1 and 2, which characterize all possible knotted or linked products brought about 
by tyrosine recombinases and serine recombinases respectively. If the substrate is an unknot 
or unlink we will also show that all nontrivial products are in the torus link family T(2,n) 
or the clasp link family C(r, s) (i.e., consisting of one row of r crossings and a non- adjacent 
row of s crossings). Note that C(r, ±2) is the well known family of twist knots and links. If 
the substrate is T(2,m), then all products are in the family of knots and links illustrated in 
Figure 10. 




Figure 10. We show that all knotted and linked products are in this family. 

Observe that in Figure 10, p, q, r, and s can be positive, negative, or zero. Furthermore, by 
letting p, q, r, and/or s equal or 1 as appropriate, we obtain the five subfamilies illustrated 
in Figure 11. Subfamily 3 is the family of pretzel knots or links K{p,q,r) with three non- 
andjacent rows containing p crossings, q crossings, and r crossings respectively. Observe that 
Subfamily 4 is a connected sum. However, if g = 0, r = 1, and s = —1, then it is a T(2,p) 
together with an unlinked trivial component. 



KNOTS AND LINKS ARISING FROM RECOMBINATION 



9 




Figure 11. These subfamilies are contained in the family illustrated in Figure 10. 

Theorem 1. Suppose that Assumptions 1, 2, and 3 hold for a particular tyrosine recomhinase- 
DNA complex. If the substrate is an unknot then the only nontrivial products are T(2,n) or 
C(2,n). If the substrate is an unlink, then the only nontrivial product is a H op f link, T(2,2). 
// the substrate is T(2,m), then all of the products are contained in the family illustrated in 
Figure 10. 

Proof. We saw that as a result of Assumption 3, after recombination with tyrosine recom- 
binases, the fixed projection of -B H J is ambient isotopic fixing dB to one of the five forms 
illustrated in Figure 7. Also, by Lemma 1, C n J is ambient isotopic fixing dB to one of 
the four forms illustrated in Figure 8. For each of the four forms of C H J, the products of 
recombination with tyrosine recombinases are obtained by replacing B with each of the five 
post- recombinant forms of Pi J in Figure 7. The resulting products are illustrated in Figure 
12. Recall that if J is the unlink then C D J must have Form CI, and if J is an unknot then 
C f] J must have Form CI or C2. The theorem follows from Figure 12. 

□ 

Theorem 2. Suppose that Assumptions 1, 2, and 3 hold for a particular serine recombinase- 
DNA complex. If the substrate is an unknot then the only nontrivial products are T(2, n) 
or C{p,q). If the substrate is an unlink, then the only nontrivial product is T(2,n). If the 
substrate is T(2,m) then all products are in the family illustrated in Figure 10. 

Proof. We saw that as a result of Assumption 3, after n recombination events with serine 
recombinases, the fixed projection of -B H J is ambient isotopic fixing dB to Form nl or n2, 
illustrated in Figure 6. Also, by Lemma 1, C H J is ambient isotopic fixing dB to one of the 
four forms illustrated in Figure 8. We obtain the products of serine recombinase from each 
of the forms of C H J illustrated in Figure 8 by replacing B with each of Form nl and Form 
n2. The resulting products are illustrated in Figure 13. Note that when C n J has Form C4, 
then B nJ must have Form Bl. Hence the post-recombinant form oi BCiJ must be of Form 
nl. Recall again that if J is an unlink, then CD J must have Form CI, and if J is an unknot 
then C n J must have Form CI or C2. The theorem follows from Figure 13. □ 

Table 1 summarizes the results of Theorems 1 and 2. 
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trivial knot trivial link trivial knot trivial knot Subfamily 1 

T(2,2) 




Subfamily 1 trivial knot Subfamily 1 Subfamily 1 Subfamily 2 

T(2,p) T(2,p±l) T(2,p±2) C(p,±2) 




Figure 12. Products of recombination with tyrosine recombinases. 



5. The minimal crossing number and our model 

5.1. Our family grows with n^. The minimal crossing number of a DNA knot or hnk can 
be determined experimentally using gel electrophoresis. However, there are 1,701,936 knots 
with minimal crossing number less than or equal to 16 [13], and the number of knots and links 
with minimal crossing number n grows exponentially as a function of n [10]. By contrast, 
we will now prove that the total number of knots and links in our product family (Figure 
10) grows linearly with n^. Note that, while the knots and links in our family have at most 
four rows containing p, q, r, and s signed crossings respectively, it does not follow that the 
minimal crossing number of such a knot or link is \p\ + \q\ + |r| + \s\. If the knot or link is 
not alternating, it is quite possible that the number of crossings can be significantly reduced. 
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Figure 13. Products of recombination with serine recombinases. 



Recombinase Type 


Substrate 


Nontrivial Products 


Tyrosine 


unknot 


T(2,n), C7(2,n) 




unlink 


Hopf link= r(2,2) 




T(2,m) 


Any from Figure 10 


Serine 


unknot 


r(2,n), C{p,q) 




unlink 


T{2,n) 




r(2,m) 


Any from Figure 10 



Table 1. Non-trivial products predicted by our model. 



Thus there is no reason to a priori believe that the number of knots and links in our product 
family should grow linearly with n^. 

We begin with some results about minimal crossing number which will be used in our 
proof. We shall denote the minimal crossing number of a knot or link K by MCN{K). 

Lemma 2. Let \r\ > 1 and |s| > 1. Then C{r, s) is equivalent to both K{t + 1, —1, s + 1) and 
i^(r — 1, 1, s — 1). Furthermore, if r and s have the same sign then MCN{C{r, s)) = |r|-|-|s| — 1, 
and if r and s have opposite sign then MCN(C(r, s)) = |r| + 

Proof. Figure 14, we show that C(r, s) is ambient isotopic to K(r + 1,— 1,5 + 1) by moving 
the highlighted strand in front of the diagram and then turning the horizontal row of s half- 
twists so that they become vertical. Analogously, by moving the highlighted strand behind 
rather than in front of the rest of the diagram, we see that C(r, s) is also ambient isotopic to 
K{r - 1,1, s - 1). 

We evaluate MCN(C(r, s)) as follows. Murasugi [17] and Thistlethwaite [20] proved that 
any reduced alternating diagram has a minimal number of crossings. Observe that if r and 
s have opposite signs, then the standard diagram of C(r, s) is reduced and alternating. In 
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C(r,s) 




K(r+l,-l,s+l) 



Figure 14. An isotopy from C(r, s) to K{r + 1, —1, s + 1). 



this case, MCN(C(r, s)) = |r| + If r and s have the same sign and |r|, |s| > 1, then 
either r, s > 1 or r, s < —1. If r, s > 1, then the diagram of K{r — 1, 1, s — 1) is reduced 
and alternating, since all three rows of crossings are positive. In this case, MCN(C(r, s)) = 
MCN(if(r - 1, 1, s - 1)) = r - 1 + 1 + s - 1 = |r| + |s| - 1. If r, s < -1, then the diagram of 
i^(r + l, — 1, s + is reduced and alternating, since all three rows of crossings are negative. In 
this case, MCN(C(r,s)) = MCN(/r(r + l, -1, s + 1)) = -(r + l) + l-(s + l) = |r| + |s|-l. □ 

To prove our theorem, we will also make use of the following theorem of Lickorish and 
Thistlethwaite. 

Theorem [16] Suppose that a knot or link L has a projection as in Figure 15 with k > 3, 
and for each i, Ri f] L is a reduced alternating projection which contains a crossing between 
the two arcs at the bottom of Ri (as in Figure 15) and at least one other crossing. Then the 
projection of L has a minimal number of crossings. 




Figure 15. Each Ri is reduced, alternating, and has at least two crossings. 

We shall adopt the language of Lickorish and Thistlethwaite and refer to a projection of 
the form described by their theorem as a reduced Montesinos diagram. Thus by the theorem, 
any projection of a knot or link which is a reduced Montesinos diagram has a minimal number 
of crossings. 

Theorem 3. The number of distinct knots and links in the product family illustrated in 
Figure 10 which have MCN= n grows linearly with . 

Proof. We begin by fixing n, and suppose that X is a knot or link projection in the family of 
Figure 10 which has MCN= n. This projection has \p\ + \q\ + |r| + \s\ crossings. If the given 
projection of K is reduced alternating or reduced Montesinos, then \p\ + \q\ + |r| + |s| = n. 
Otherwise, we show that K is ambient isotopic to one of 24 possible projections which have a 
minimal number of crossings. We will then show that there are at most 96n^ possible knots 
and links in our family with MCN= n. 
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The following example illustrates the type of strand move we shall use to reduce the number 
of crossings whenever the diagram is neither reduced alternating nor reduced Montesinos. 
Observe that the part of our knot or link consisting of the rows containing r and s crossings 
is alternating if and only if r and s have opposite signs. If r and s have the same sign, then 
by moving a single strand (as in Figure 16), this part of the knot or link becomes alternating. 
This isotopy removes a crossing from both the r row and the s row and adds a single new 
crossing. Thus we reduce this part of the diagram from having \r\ + |s| crossings in a non- 
alternating form to having (|r| — 1) + (|s| — 1) + 1 crossings in an alternating form. All of 
the isotopies we use to to get rid of unnecessary crossings involve moving at most three such 
strands. 




Figure 16. By moving a single strand we reduce from |r| + |s| crossings 
originally to (|r| — 1) + (|s| — 1) + 1 crossings in the alternating diagram. 

Next we will discuss the one exceptional case where we cannot obtain a reduced alternating 
or reduced Montesinos diagram by moving some strands of K. This is the case when K is 
a knot or link in our family with r > 1, p, q < —2, and s = 1. In its original form, the 
projection has —p — q + r + 1 crossings. We can move a single strand of the diagram to obtain 
a projection with only —p + {—q — 1) + (r — 1) + 1 crossings (illustrated on the left in Figure 
17). We define a Hara-Yamamoto projection as one in which there is a row of at least two 
crossings which has the property that if this row is cut off from the rest of the projection 
and the endpoints are resealed in the two natural ways, then both resulting projections are 
reduced alternating. The projection on the left of Figure 17 is Hara-Yamamoto because the 
projections (on the right) obtained by resealing the endpoints are both reduced alternating. 
Hara and Yamamoto [14] have shown that any Hara-Yamamoto projection has a minimum 
number of crossings. Thus the projection on the left of Figure 17 has a minimal number of 
crossings. 




Figure 17. If we cut off the row of p crossings on the left and reseal the 
strands in the two natural ways, then both resulting projections are reduced 
alternating. 
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We will consider 27 cases according to the values of p, g, r, and s, and show that in all but 
the above exceptional case K is isotopic to a diagram that is either reduced alternating or 
reduced Montesinos and hence has minimal crossing number. Since there are so many cases, 
we display the results in a table rather than discussing each case individually. We make the 
following notes about the table. In the second column wc list the form of the knot or link 
which has a minimal number of crossings (e.g. reduced alternating). If the knot or link is 
isotopic to a clasp, pretzel, or torus knot or link we will list the specific form (e.g. r(2, n)). 
If the minimal crossing form is either a clasp C(r, s) or a pretzel of the form K{r, ±1, s) then 
(according to Lemma 2) which one of these is the minimal crossing form depends on the 
signs of r and s. In this case, we just list one of these two forms though the one we list is 
not necessarily the form with the fewest number of crossings, as we do not know the signs 
or specific values of the variables. In this case, for the MCN we write an expression with 
(-1?) at the end to mean that depending on the relevant variables the MCN may be one 
smaller. If one of these knots or links contains a trivial component, we use the shorthand + 
O to indicate this in the table. We shall consider a knot or link and its mirror image to be 
of the same link type, and hence we will not count both. Thus without loss of generality, we 
shall assume that r > 0. Also, observe that the rows of crossings containing p and q crossings 
are intechangeable in Figure 10, so we treat the variables p and q as interchangeable. We 
list the MCN as an unsimplificd function of p, q, r, and ,s to help the reader recreate the 
isotopy taking the original form to the minimal crossing form. Finally, apart from the cases 
where K reduces to T(2, m) or C(2, m), we obtain the upper bounds for the number of links 
in each case by expressing MCN= n as a sum of nonnegative integers. This enables us to 
find an upper bound for the number of knots and links with MCN= n in each case. Note 
that the upper bounds given are intended to be simple rather than as small as possible. In 
particular, a number of our cases overlap, and thus some knots and links are counted more 
than once. Also, for certain specific values of p, q, r, and s, we may obtain a trivial knot or 
link. However, we do not specifically exclude these cases from our table. 

There are 26 non-trivial cases in the table. However, all three instances of a T{2, m) yield 
the same knot or link. Thus there are at most 24 distinct families of knots and links listed 
in the table. The number of knots and link in each of these families is bounded above by 
4n^ (in fact, for most of the cases there are significantly fewer than An^ knot and link types). 
It follows that for a given n, the number of distinct knots and links in our product family 
(Figure 10 which have MCN= n is bounded above by 24 x 4n^ = 96n'^. In particular, the 
number of distinct knots and links with the form of Figure 10 which have MCN= n grows 
linearly with n"^. 

□ 

It follows from Theorem 3 that the proportion of all knots and links which are contained 
in our family decreases exponentially as n increases. Thus, for a knotted or linked product, 
knowing the MCN and that it is constrained to this family allows us to significantly narrow 
the possibilities for its precise knot or link type. The model described herein thus provides an 
important step in characterizing DNA knots and links which arise as products of site-specific 
recombination. 

5.2. Products vi^hose MCN is one more than the substrate. Finally, we prove a more 
directly applicable theorem as follows. Site-specific recombination often adds a single crossing 
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Values of p, q, r, s 


Minimal crossing form 


Strands 


MCN written as a sum 


Upper bound 


for r > 




moved 


of nonnegative integers 


on # of links 


p = q = 


C{r, s)+0 





r + \s\ (-1?) 


4n 


r = 


T{2,p + q) 





\p + q\ 


1 


r = l,p^O,q = 


T(2,p)#r(2,s + 1) 





W + k + 


2n 


r = l,p^O,q^O 


K{p,q,s + 1) 





|p| + M + |s + i| (-1?) 


8n^ 


r > 1, p / 0, (i = 


T(2,p)#C(r,s) 





|p|+r-s (-1?) 


8n'' 


r > 1, pg = — 1 


T(2,r) 





r 


1 


r > 1, pq = 1, s = 


C(±2,r) 





2 + r (-1?) 


2 


r>l,p>l, g = l, s>0 


reduced alternating 


1 


p+(r-l) + (.s-l) + 2 


n' 


r > 1, p = q — 1, s < 


reduced alternating 


1 


r + (-S - 1) + 2 


n 


r > 1, p < -1, q = -1, s > 


reduced alternating 


2 


-p + (r - 1) + (s - 2) + 2 


n' 


r > 1, p, q <0, s < 


reduced alternating 





— p — q + r — s 




r > 1, p, q > 1, s = 


reduced alternating 


2 


(p-l) + ((7-l) + (r-2) + 2 


n' 


r > 1, p < -1, q> I, s = 


reduced alternating 


1 


-p + (q - 1) + (r - 1) + 1 




r>l, |p|>l, |g| = l, s = 


C(r±l,p) 





-p+(r±l) (-1?) 


4n 


r > 1, qs = —1 


T(2,r + p±l) 


1 


|r+p± Ij 


1 


r > 1, p > 0, q ^ 1, s <0 


reduced alternating 


1 


p + r + (-s-l) + l 


n' 


r>l,p<-2,q=l,s<-2 


reduced alternating 


1 


(-P - 2) + r + (-6- - 2) + 1 


n' 


r > 1, p, q > 0, s — 1 


reduced alternating 


1 


p + g + (r + 1 


n' 


r > 1, p < -1, q > 0, s = 1 


reduced alternating 


1 


(-p-l)+g + (r-l) 


n' 


r > 1, p < -1, g = 1, s > 1 


reduced alternating 


2 


(-P - 1) + (r - 1) + (s - 1) + 2 


•2 

n 


r > 1, p > 1, Q = -1, s < 


reduced alternating 


1 


(p-l)+r-s + l 




r > 1. p > 1. q = -1. ,s = 2 


trivial 


2 


^ n 





>!.])> i. q = — i. > 2 


reduced alternating 


■> 
'} 


(p-2) + (r-i)-(.-;i) + 2 


ir 


r > 1, IpI, jg| > 1, 6- < 


reduced Montesinos 





IpI + |g| + r - s 


4n'' 


r>l, IpI, |g| > 1, s > 1 


reduced Montesinos 


1 


|p| + |g| + (r-l) + (s-l) + l 


4n-' 


r > 1, p < -1, g = -2, s = 1 


K{p,2,r-1) 


1 


-p + 2 + (r - 1) 


n 


r > 1, p, q < —2, s = 1 


Hara-Yamamoto 


1 


-p+{-q-l) + {r-l) 





Table 2. The minimal crossing forms of links in the family in Figure 10 



to the MCN of a knotted or linked substrate. If the substrate is T(2,m) and the product of 
a single recombination event has MCN = m + 1, then we can further restrict the resulting 
knot or link type. 

Theorem 4. Suppose that Assumptions 1, 2, and 3 hold for a particular recomhinase-DNA 

complex with substrate T(2,m), with m > 0. Let L be the product of a single recombination 
event, and suppose that MCN(L) = m + 1. Then L is either T{2,m+ 1), C{—2,m — 1), or 
K{s, t, 1) with s,t > and s + t = m (see Figure 18). 

Proof. For m = 1, r(2, 1) is the trefoil knot 3i, and hence L must be the figure eight knot 
4i which can also be written as i^(2,l,l). Thus from now on we assume that m > 2. By 
Assumption 1, there a projection of J such that B n J has at most one crossing. Since 
J = r(2,m), the proof of Lemma 1 shows that C D J is ambient isotopic, fixing dB to a 
projection with Form C2, C3, or C4 (see Figure 8). Furthermore, when C H J has Form C4, 
then p + q = m. By Assumption 3, after a single recombination event with either serine or 
tyrosine recombinases the post-recombinant form of -B n J is ambient isotopic fixing dB to 
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Figure 18. These are the only possible products, if the substrate is T(2,m) 
and the product has MCN= m + 1. 

one of those illustrated in Figure 7. Thus any knotted or linked product L has one of the 
forms illustrated in Figure 12. 

First suppose that L has one of the forms illustrated when C f] J has Form C2 or C3. We 
see that L cannot be T(2, m)#T(2, 2), since MCN(T(2, m)#T(2, 2)) = m + 2. Certainly, L 
cannot be T(2, m) with a trivial component. If L = T{2, n) then n = m + 1, so we are done. 
If L = C(— 2, n) and n > 1, then n = m — 1, so again we are done. If L = C(2, n) and n > 1, 
then L = K{1, 1, n — 1). In this case n = m, and again we are done. 

Now suppose that L has one of the forms illustrated when C D J has Form C4. If L = 
K{p, q, a) for some value of a, then L has a projection in the product family illustrated in 
Figure 10 with r = 1. Otherwise, L is a member of the product family with r = ±2. However, 
if r = —2, we can turn over the top loop to get r = 2 (this will also add on positive crossing 
to the s row). Thus we shall now assume that L has a projection in the product family (i.e.. 
Figure 10) with either r = 1 or r = 2. Table 2 lists all of the nontrivial knots and links 
in this family when r > 0. Thus all of the products that we are considering occur in Table 
2. We would like to know which of the cases in Table 2 have r = 1 oi 2, p + q > 2, and 
MCN= p + q + 1. The following table answers this question. 

The only subtle case in the table is where L = C{r ziz l,p). In this case we must have 
r = 2, \q\ = 1, and p + q > 2. It follows that p > 1. Since L is non-trivial, we must have 
L = C{3,p) = K{2, l,p - 1). Now MCN(L) = m + 1 implies that p-l + 2 = m = p + q. 
Thus L = K{s, t, 1) where s + t = p + q. Now from Table 3, we can see that if r = 1 or 2, 
p + g > 2, and MCN(L) =p + g + 1, then L is either r(2,m + 1), C(m- 1,-2), or K{s,t, 1) 
with s, q > 0, and s + t = m. □ 

We now illustrate an application of Theorem 3 (further applications of our model are 
discussed in [2]). Bath et al used the links 6^ and 8^ as the substrates for Xer recombination, 
yielding a knot with MCN=7 and a knot with MCN=9, respectively. These products have 
not been characterized beyond their minimal crossing number, and MCN is not sufficient to 
determine the knot type. In particular, there are seven knots with MCN=7 and 49 knots 
with MCN=9. 

Theorem 4 significantly reduces the number of possibilities for each of these products. In 
particular, it follows from Theorem 4 that the 7-crossing products of Xer must be 7i = T(2, 7), 
72 = ^(5, —2) or 74 = iir(3,3, 1); and the 9-crossing products of Xer must be 9i = T(2,9), 
92 = C(7, —2), or 95 = A'(5,3, 1). All of these possible products are actually 4-plats. This 
example shows how our model complements the work of [8], which restricts attention to the 
tangle model, and thus assume that all products are 4-plats. In [1], building on earlier work 
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Values of p, q, r, s 


Minimal crossing form 


MCN written as a sum 


Is r = 1 or 2 


Can MCN= 


for r > 




of nonucgative integers 


and p + g > 2? 


P + g+ 1? 


p = q = 


C(r,s)+0 


r+|.s| (-1?) 


no 


- 


r = 


T(2,p + g) 


IP + al 


no 


- 


r = l,p^O,q = 


T(2,p)#T(2,s + l) 


\p\ + \s + A 


yes 


no 


r = l,p^O,q^O 


K{p,q,s + 1) 


|p| + M + |s + i| (-1?) 


yes 


if s = 


r > 1, p / 0, (i = 


T{2,p)#C{r,s) 


|p|+r + |s| (-1?) 


yes 


no 


r > 1, pq = —I 


T(2,r) 


r 


no 


- 


r > 1, pq = 1, s = 


C(±2,r) 


2 + r (-1?) 


yes 


ifr=p + q— 1 


r>l,p>l, g = l, s>0 


reduced alternating 


p+(r-l) + (s-l) + 2 


yes 


no 


r > 1, p = q — 1, s < 


reduced alternating 


r + (-S - 1) + 2 


yes 


no 


r > 1, p < -1, q = -1, s > 


reduced alternating 


-p + (r - 1) + (s - 2) + 2 


no 


- 


r > 1, p, q <0, s < 


reduced alternating 


— p — q + r — s 


no 


- 


r > 1, p, q > 1, s = 


reduced alternating 


(p-l) + ((7-l) + (r-2) + 2 


yes 


only if r = 3 


r > 1, p < -1, q> I, s = 


reduced alternating 


-p+(g- l) + (r-l) + l 


yes 


no 


r>l, |p|>l, |?| = 1, s = 


C(r±l,p) 


-p + (r±l) (-1?) 


yes 


yes 


r > 1, qs = —1 


T(2,r + p±l) 


|r + p± Ij 


yes 


yes 


r > 1, p > 0, q ^ 1, s <0 


reduced alternating 


p + r+ (-S- 1) + 1 


yes 


no 


r>l,p<-2,q=l,s<-2 


reduced alternating 


(-P - 2) + r + (-6- - 2) + 1 


no 


- 


r > 1, p, q > 0, s — 1 


reduced alternating 


p + g + (r - 1) + 1 


yes 


no 


r > 1, p < -1, q > 0, s = 1 


reduced alternating 


(-P - 1) + q+ (r- 1) 


yes 


no 


r > 1, p < -1, g = 1, s > 1 


reduced alternating 


(-P - 1) + (r - 1) + (s - 1) + 2 


no 




r > 1, p > 1, Q = -1, s < 


reduced alternating 


(p-l)+r-s + l 


yes 


no 


r > 1. p > 1. q = -1. ,s = 2 


trivial 


^ n 


yes 


no 


>!.])> i. q = — i. > 2 


reduced alternating 


(p-2) + (r- I) + {.-:',) + 2 


yes 


IK.) 


r > 1, IpI, > 1, 6- < 


reduced Montesiiios 


IpI + \q\ + r- s 


yes 


no 


r>l, IpI, |g| > 1, s > 1 


reduced Montesinos 


|p| + |g| + (r-l) + (s-l) + l 


yes 


no 


r > 1, p < -1, g = -2, s = 1 


K(p,2,r-1) 


-P + 2 + (r- 1) 


no 




r > 1, p, q < —2, s = 1 


Hara-Yamamoto 


-p+ (-q- 1) + (r- 1) 


no 





Table 3. Which products can have MCN= p + q + 17 



of [4, 5, 8, 21], we use our model together with tangle calculus to completely classify all tangle 
solutions to these Int-Xer equations. 
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